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Crystal structure of insecticidal 5-endotoxin 
from Bacillus thuringiensis at 2.5 A resolution 
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The structure of the 5-endotoxin from Bacillus 
thuringiensis subsp. tenebrionis that is specifically 
toxic to Coleopterajnsects (beetle toxin) has been 
determined at 2.5 A resolution. It comprises three 
domains which are, from the N- to C-termini, a 
seven-helix bundle, a three-sheet domain, and a (3 
sandwich. The core of the molecule encompassing 
all the domain interfaces is built from conserved 
sequence segments of the active 5 -endotoxins. 
Therefore the structure represents the general fold 
of this family of insecticidal proteins. The bundle 
of long, hydrophobic and amphipathic helices is 
equipped for pore formation in the insect mem- 
brane, and regions of the three-sheet domain are 
probably responsible for receptor binding. 



The 5-endotoxins are a family of insecticidal proteins produced 
by Bacillus thuringiensis (B.t.) during sporulation, having relative 
molecular masses (M T ) 60,000-70,000 (60K-70K) in the active 
form and specific toxicities against insects in the orders of 
Lepidoptera, Diptera and Coleoptera 1,2 . These toxins have been 
formulated into commercial insecticides for three decades 3 , and 
now insect-resistant plants are engineered by transformation 
with Lepidoptera-specific toxin genes 4 " 6 . In the bacterium 5- 
endotoxins are synthesized as protoxins of M r s 70K-135K and 
crystallize as a parasporal inclusion ~1 fx in size, in which form 
they are ingested by the susceptible insect. The microcrystal 
dissolves in the alkaline pH of the midgut and the protoxin is 
cleaved by gut proteases to release the active toxin. 5-Endotoxins 
activated in vitro bind specifically and with high affinity (k D ~ 
0.1-20 nM) to protein receptors on brush-border membrane 
vesicles derived from the gut epithelium of target insects 7 " 9 and 
create leakage channels of 10-20 A diameter in the cell mem- 
brane 10 . In vivo such membrane lesions lead to swelling and 
lysis of the gut epithelium 11 and death of the insect ensues 
through starvation and septicaemia. Active 5-endotoxins of 
different specificities show five strongly conserved regions in 
their amino-acid sequences 1,12 . Exchanging sequence segments 
in the divergent regions between toxins of different specificities 
can produce active hybrids showing altered target 
specificity 13 "' 5 . We have determined the atomic structure of a 
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Coleoptera-specific 6-endotoxin (CrylllA, beetle toxin) from 
B.t. subsp. tenebrionis 1 *' 1 * to elucidate the structural basis for 
target specificity and membrane perforation by this family of 
proteins. 

Structure determination 

Parasporal crystals of the beetle toxin contain the full-length 
644-residue protoxin 17 as the minor component, and a product 
of bacterial processing with 57 residues removed from the N- 
terminus as the major component 19 . The latter (M r 67K) is 
similar in sequence to the active form of other 6 -endotoxins. 
After solubilization, papain cleavage converts the mixture to the 
67K toxin (see legend to Table I). This was recrystallized in the 
original crystal form of the parasporal crystals, space group 
C222 x and cell dimensions 1 17.1 by 134.2 by 104.5 A, containing 
one molecule per asymmetric unit and 55% solvent by volume 18 . 
Initial evaluation of derivatives was carried out at 4.5 A resol- 
ution with data collected on the FAST TV diffractometer 20 using 
CuKa radiation. Complete datasets (Table 1) were then collec- 
ted to 2.5 A resolution from native crystals using the imaging 
plate systems at the EMBL outstation at DESY and from the 
mercury and platinum derivatives on film at SRS Daresbury. 
The electron density map (Fig. 1) at 2.5 A resolution calculated 
with phases from multiple isomorphous replacement (mean 
figure of merit, 0.63) was easily interpretable and was improved 
by solvent flattening 21,22 . A continuous polypeptide chain from 
residue 61 to residue 644 at the C terminus was traced unam- 
biguously, and most side-chain atoms could be located in the 
map. The atomic model was built using the graphics program 
O (ref. 23) and had an initial K-factor of 37% for all data to 
2.5 A. After preliminary refinement using the program X-PLOR 
(ref. 24), the current model, containing 584 amino acid residues 
and 40 bound water molecules, has an R-factor of 19.9% and 
r.m.s. bond length deviation of 0.017 A. 

Description of the structure 

Overview. The beetle toxin is a wedge-shaped molecule with a 
radius of gyration of 58 A. As shown in Fig. 2a, it comprises 
three domains. Domain I, from the N terminus of the 67K toxin 
to residue 290, is a seven-helix bundle in which a central helix 
is completely surrounded by six outer helices tilted at about 
+20° to it (Fig. 3b,c). Domain II, from residues 291 to 500, 
contains three antiparallel p sheets packed around a hydro- 
phobic core with a triangular cross-section (Fig. 4). Domain III, 
from residues 501 to 644 at the C terminus is a sandwich of two 
antiparallel /3 sheets (Fig. 5). Domains I and III make up the 
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TABLE 1 Data collection and phasing statistics 



Data collection 



Data 

Native 

CH 3 HgN0 3 

Hg(CH 3 C00) 2 

c/s-Pt(NH 3 ) 2 CI 2 

K 2 0s0 4 

HoCI 3 

Phasing statistics 

Derivative 

CH 3 HgN0 3 

Hg(CH 3 C00) 2 

c/s-Pt(NH 3 ) 2 CI 2 

K 2 0s0 4 

H0CI3 



Method of 




Number of 


Resolution 


Number of 


Unique reflections 




collection 




crystals 


(A) 


measurements 


(% completeness) 


0 

"merge 


image plate 




8 


2.5 


121,767 


27,727 (100) 


0.108 


film 




7 


2.5 


4 A<j COO 

103.623 


**%*7 ?C7 M r\r\\ 

27, (Of (100) 


0.095 


film 




5 


2.5 


60,224 


25919 (94 5) 


0.103 


film 




7 


2.5 


86,629 


25924 (94 5) 


0.107 


FAST 




1 


4.5 


21.143 


4 680 (100) 

HjV/UV/ \ X.\J\J f 


0.077 


FAST 




1 


4.5 


20.013 


4 701 (100) 


0.069 














Phasing powerS 


Anomalous data Number of sites 


fl<Jeriv T 




(resolution, A) 




no 




3 


0.183 


0.715 


1.56 (2.5) 




yes 




6 


0.247 


0.609 


2.28 (2.5) 




no 




5 


0.185 


0.682 


1.54 (2.5) 




no 




4 


0.149 


0.757 


1.26 (5.5) 




no 




3 


0.095 


0741 


1.35 (5.0) 



Protein preparation: Solubilized parasporal crystals from BX subsp tenebhonis were incubated at 0.5mgml _1 protein with 0.125 units per ml of 
Agarose-linked papain (Boehringer) in 3.3 M NaBr. 0.05 M sodium phosphate, pH7.0. and 0.1 mgml 1 phenylmethylsulphonylfluoride (PMSF) for 30min at 
20 °C. Digestion was stopped by adding tosyl fysinechloromethylketone (TLCK) to 0.125 mgm 1 and Na 2 C0 3 to one fifth volume and removing the 
enzyme-beads. The 67K beetle toxin was then purified by gel filtration on Sephadex G75 equilibrated with 0.1 M NaHC0 3 , pH 10.5. 0.5 M NaBr. Crystallization: 
Single crystals were obtained by microdialysis at a protein concentration of 2.5 mgml" 1 against 0.1 M NaHC0 3 . pH9.5. 1.2 M NaBr at 4°C overnight, then 
against 0.1 M NaHC0 3 , pH 9.2, 0.5 M NaBr at 16 °C; 3 mM NaN 3 , 0.1 mM PMSF and 0.1 mg ml 1 TLCK were present in all buffers. Crystals were transferred 
by stages to 0.05 M 2-(A/-morpholino)ethanesulphonic acid (MES), pH6.5. for derivative preparation and mounted in 0.03% low-melting agarose in this buffer 
during data collection. Data collection: Image plate and film data were processed using MOSFLM (Imperial College. London) and CCP4 programs (Daresbury. 
UK). FAST (ref . 20) data were collected and processed with MADNES 45 . and scaled in 3° batches. Derivatives: Crystals were soaked respectively in 0.25 mM 
CH 3 HgN0 3 for 3.5 h, in 1 mM Hg(CH 3 C00) 2 for 14 h. in freshly prepared 1 mM c/s -Pt(NH 3 ) 2 CI 2 for 21 h. in saturated K 2 0s0 4 for 35 h. and in 2 mM HoCI 3 for 
3 days. Phase calculation: Two heavy-atom sites in each derivative were located from difference Patterson functions, except in the case of Hg(CH 3 CO0) 2 
for which 3 sites were located, and the remaining sites were found by cross-phased difference Fouriers. Heavy-atom parameters were refined against 
centric data and phases calculated for all data using the program PHARE (G. Bricogne). The two low-resolution derivatives were refined against phases 
calculated from the high-resolution derivatives. Phasing with the three high-resolution derivatives gave an overall figure of merit of 0.61 (25-2.5 A) and a 
clearly interpretable map. Including the remaining derivatives slightly improved the connectivity of the map (overall figure of merit 0,63). and four cycles of 
solvent flattening using a 50% solvent content and a 9 A radius in mask calculation 21 27 improved the overall definition of densities. The starting model 
was built using the program 0 (ref. 23) with the Bones option for main-chain tracing and the autobuild and manip options for side chains. Refinement by 
simulated annealing using the program X-PLOR (ref. 24) reduced the R-factor from 0.37 to 0.25 without individual 6-factors. and to 0.23 with restrained 
individual B-factors. The model was adjusted in the loops 154-156, 429-436. and 483-488, and had 40 solvent molecules added, then refined by X-PLOR 
again. The current model has an fl-factor of 19.9%. with r.m.s. bond length deviation of 0.017 A, r.m.s. bond angle deviation of 3.2°. and average atomic 
B-factor of 18 A 2 

* Emerge = 1 1- 1'/ - (OUZ !<0|. where I, are intensity measurements for a reflection, and </) is the mean intensity for this reflection. 

* ff<jeriv=I \K*-F*VZ I^pI- wnere is tr, e structure factor amplitude of the derivative crystal and F P is that of the native. 

+ flcuiiis = 1 1 I^ph ± r>| ~ /^calcJl/V - F P \, where F P and F PH are defined as for iv . and F„(ca1c) is the calculated heavy-atom structure factor amplitude 
summed over centric data only. 
§ Phasing power ~<F H )/E, the r.m.s. heavy-atom structure factor amplitudes divided by the residual lack of closure error. 



bulky end of the molecule. Through their contact one of the 
two p sheets in domain III is almost entirely buried. To our 
knowledge (see, for example, ref. 25), the packing of helices in 
domain I and of sheets in domain II are both novel arrange- 
ments. 

Domain I. The central helix in this seven-helix bundle is a 5 (Fig. 
36,c), which is oriented with its C terminus towards the bulky 
end of the molecule. Viewed from this end, the outer helices 
are arranged anticlockwise in the order of a lt a 2 > a 3 , a 4 , a 6 
and a 7 , with helices a, and a 7 adjacent to the /3-sheet domains; 
a 2 is interrupted by a non-helical section and only the leading 
half, «2a» is packed against <x 5 . Figure 3a shows the alignment 
of amino-acid sequence on the surfaces of the helices. The 
helices are long, especially a 3 to a 7 , which contain respectively 
8, 7, 6, 9 and 7 complete helical turns and hence would be long 
enough to span the 30-A thick hydrophobic region of a mem- 
brane bilayer. Furthermore, the six outer helices bear a strip of 
hydrophobic residues (defined by AG 2*0 for transfer from oil 
to water) down their entire length on the side-facing helix a 5 , 
so they are amphipathic. In keeping with the general observation 
that secondary structures are close-packed and bury hydro- 
phobic surfaces 26 , the helix contact angles in this domain cluster 
around +20° rather than -50°, giving the bundle a bouquet-like 
appearance (Fig. 36). Figure 3c shows the bundle in cross- 
section. The interhelical space contains 27 aromatic residues 
which are packed in the edge-to-face fashion 27 ; all polar groups 
in this region are hydrogen-bonded or in salt bridges. 
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The concentric arrangement of the seven-helix bundle is dis- 
tinct from the two-layered type seen in bacteriorhodopsin. There 
is some resemblance to the pore-forming domain of colicin A 28 , 
in which two hydrophobic helices are shielded from solvent by 
eight amphiphilic helices, but the colicin helices are generally 
shorter. Like the colicin helices, the bundle in the beetle toxin 
may be a soluble form of packaging for the hydrophobic and 
amphiphilic helices that will form pores in the membrane after 
a large change in conformation. 

Domain II. In Fig. 4a and 46 the three sheets of this domain are 
laid side-by-side, as they would be seen from the solvent. There 
is an apparent structural duplication between the four-stranded 
antiparallel sheets, sheet 1 and sheet 2. The chain connections, 
Pa, Pi, Ps and 0 8 , /3 7 , 0 6 , 0 9 , respectively, follow the order 
of +3, - 1 , - 1 , +3, which is typical of the 'Greek-key' topology 29 . 
From both sheets the inner strands, 0 3 and p 2 as well as p 7 and 
/3 6 , extend some 20 A to the apex of the molecule as two- 
stranded p ribbons; and at the point of departure from the 
sheets there is a p -bulge in /3 3 and in p 7 to twist the plane of 
the ribbon by nearly 90° relative to the sheet. The connections 
between the outer strands cross over the ribbons on the solvent 
side. 

The pseudo-symmetry between these sheets is very approxi- 
mate. Using the least squares option in O (ref. 23), the sheet 
region of the strands 0 3 and p 2 can be brought to superimpose 
on that of p n and p 6t with a r.m.s. fit of 0.72 A for 13 a carbons. 
But the r.m.s. fit increased to 1.1 A for 23 a carbons of the 
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FIG. 1 Electron density map in the neighbourhood 
of Cys 243, calculated a, using combined phases 46 
from multiple isomorphous replacement and sol- 
vent flattening, and b. using combined experi- 
mental and model phases 46 after refinement by 
X-PLOR. The refined structure is shown superim- 
posed for reference. Although Cys 243 is a major 
site of both the methylmercury (MM) and mercuric 
acetate (MA) derivatives, the methyl mercury site 
is in a hydrophobic environment compared with 
the mercuric acetate site. 













1 



whole inner strands including the ribbon region, and 1.7 A for 
36 a carbons on all four strands. Nonetheless, the sequence 
alignment brought by this superposition of the two sheets 
revealed a low level of internal homology, with seven pairs of 
equivalent residues (shown in bold) out of 41 aligned a carbons: 

338 HRIQPHTRFQJ , (6)S7NYWS(l)HyVSTRPSI(0)GSHDIITSPF(10)HLKPH 395 
402 AVAUTNLAVWP(0)SAVYSG(1)TKVEPSQYU(3)DBASTQTYDS(7 )SWTSI 453 

The three-stranded sheet 3 is formed by two separate polypep- 
tide segments. The C-terminal segment of domain II contributes 
the two-stranded ribbon of p l0 and p Ut whereas the N-terminal 
segment of this domain contributes strand p x , which is hydro- 
gen-bonded to f$ u ; Pi is followed by a two-turn helix a 8 and 
an extended chain. 

Figure 4c and d shows in side view and in cross-section that 
the three antiparallel sheets are packed around a triangular 
hydrophobic core. This brings the strand p l0 on the edge of 
sheet 3 into proximity with strand /3 4 on the edge of sheet 1, as 
well as placing the loops at the end of the three /3 ribbons into 
a region of about 12 A radius at the molecular apex. This domain 
is in contact with helix a n of domain I on the face of sheet 3 
(Fig. 4c). 

Domain III. Figure 5 is a ribbon drawing of the strands forming 
the two sheets of the p sandwich. The sheet containing the 
C-terminal strand is in contact with domain I and will be called 
the inner sheet. This domain has the 'jelly-roll* topology 29 , 
because it can be generated by folding an antiparallel P ribbon 
which starts with p l3 (N terminus) and P& (C terminus) on the 
inner sheet, and ends in the loop between /3 18 and p l9 on the 
outer sheet; p l4 is a short excursion from this ribbon and forms 
the fifth antiparallel strand of the outer sheet. In addition, small 
parallel sheets are formed at the edge of the p sandwich through 
hydrogen bonding of strand p l2 to p l6 at the edge of the outer 
sheet, and p x to /3, 3 at the edge of the inner sheet. 
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Distribution of conserved sequences. The core of the beetle toxin 
molecule encompassing the domain interfaces is built from the 
five sequence blocks that are highly conserved throughout the 
5-endotoxtn family 1 (Fig. 2fc,c). Block 1, located in the beetle 
toxin sequence at residues 189-218, corresponds to the central 
helix (<* 5 ) of the bundle in domain I. Block 2, residues 239-305, 
overlaps with the latter half of or 6 , and with a 7 and jS,; the 
latter hydrogen-bonds to the edge of the inner sheet in domain 
III before forming part of the three-stranded sheet 3 in domain 
II. Block 3, residues 491-538, overlaps with the latter part of 
0,,, where it is hydrogen-bonded to p lt and with the loops 
connecting domains II and III. The remainder of block 3 
together with blocks 4 and 5, namely residues 560-569 and 633 
to the C terminus, respectively, constitute the three buried 
strands of the inner antiparallel sheet in domain III. The high 
degree of conservation of internal residues implies that 
homologous proteins would adopt a similar fold. Using the 
beetle toxin structure as a model, we can therefore propose a 
basis for the insecticidal activity of 5-endotoxins as a family. 

Basis of insecticidal function 

Solubility. The beetle toxin crystals are isomorphous with the 
parasporal crystals 18,19 and show the molecular contacts respon- 
sible for solubility behaviour hi vivo. Four intermolecular salt 
bridges, Asp 142-Arg 165, Asp 224- Arg 562, Asp 590- Arg 178, 
and Glu 223-Lys293, are located at contacts to three different 
neighbouring molecules. Such salt bridges keep the protoxin 
crystals insoluble until exposed to the extreme pHs in the insect 
midgut. 

Proteolytic activation. Pro-6 -endotoxins have M r s of either 
— 130K or —70K. Activation by larval gut proteases removes 
the C-terminal half of the larger protoxins 30 31 and cleaves them 
at residue 28 or 29 from the N terminus. The smaller protoxins, 
such as that of the beetle toxin, are processed only at the N 
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FIG. 2 Overview, a, Schematic ribbon 
representation of the beetle toxin 
showing the domain organization. 
Secondary structure assignments are 
given by Yasspa within program 0 (ref. 
23). The polypeptide pathway is indi- 
cated by colouring the chain in the rain- 
bow order, from red at the N terminus 
to blue at the C terminus. The three 
domains are: I, a seven-helix bundle 
(upper left); II, a three-sheet assembly 
(bottom); and III, a 0 sandwich (upper 
right). This and all following illustrations 
of the structure are made with the 
program MOLSCRIPT 47 . 6 and c, Ca 
trace (stereoview) of the molecule with 
the five conserved sequence blocks 
indicated by small beads at their Ca 
positions. In 6 the view is as in a, and 
in c it is down the central helix of the 
bundle from the bulky end of the 
molecule; c shows that the central helix 
of domain I and the inner sheet of 
domain III are conserved; b shows that 
the helices at the domain l-ll interface 
and the loops at the domain ll-lll inter- 
face are also conserved. Note in c the 
helix packing of six around one in 
domain I. 4 The solvent channel in the 
C222 ± lattice viewed along the c axis. 
One half of the unit cell thickness is 
shown, containing four molecules. The 
other half of the cell is related to this 
by a two-fold rotation about horizontal 
axes (blue lines) at (§, y, ±|). The stack- 
ing of both layers leaves solvent chan- 
nels that traverse the cell along the c 
direction. The N terminus of the 
molecule (arrow) is accessible from 
these channels. 
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FIG. 3 The seven-helix bundle, a. Helical nets 
showing the position of amino-acid residues along 
the 7 helices: aj (63-79); a 2 (a 2a . 85-98 and 
a 2b . 104-117), <r 3 (123-152). a A (160-185), « 5 
(193-214), q 6 (222-254) and a 7 (259-285). The 
cylindrical surface of the helices are cut longi- 
tudinally on the side facing the solvent and flat- 
tened to give a view from the interior of the bundle. 
The top of the drawing corresponds to the bulky 
end of the whole molecule. Owing to tilting of the 
outer helices, different helices are in register verti- 
cally only at a level indicated by two arrows pointed 
at a 1 and a 7 ; a 5 is the central helix. Dotted curves 
outline the strip of hydrophobic residues down the 
inward surface of the other six helices. 0. Ca trace 
(stereoview) for the bundle viewed perpendicular 
to « 5 . The relative tilt of the outer helices to a 5 
and that between adjacent outer heleices are both 
about 20°. The Ca trace is shaded grey over 
helices ol to «3 in the back, striped over helix 
a 5 in the centre, and white over helices a 4. a 5, 
and a 7 in the front, a Cross-section of the bundle 
at the level indicated by the arrows in a. viewed 
from the bulky end of the molecule. The hellical 
backbone is represented by curly ribbons passing 
through the Ca positions. The outer helices are 
positioned roughly hexagonally around the central 
one and tilted relative to it. so the bundle forms 
a left-handed superhelix. The aromatic side chains 
are packed in an edge-to-face fashion. Hydrogen 
bonds are shown for side-chain atoms. 






terminus 19,32 where about 50 residues are removed. The activated 
S-endotoxins show a conserved C-terminus, so-called sequence 
block 5 (ref. 1). Its position as the middle strand of the buried 
p sheet in domain III precludes further processing from the C 
terminus. In fact deletion from this site by 4 to 8 residues results 
in inactive mutants with altered solubility and immuno- 



genicity 



5 . This is not surprising as the inner sheet can be 



expected to play a critical part in the structural integrity and 
stability of the toxins through interaction with the helical bundle. 

At the N-terminal cleavage sites the different protoxin 
sequences show locally similar hydropathy profiles 36 3 \ which 
would be consistent with a common topology for the N-terminal 
region of the activated toxins as seen in the helical bundle of 
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the beetle toxin. In crystals of the beetle toxin, the N terminus 
at the start of helix a x borders on a large solvent channel of 
about 30 A diameter that crosses the unit cell along the c direc- 
tion (Fig. 2d). This channel could allow access of sporulation- 
associated proteases to the cleavage site in parasporal crystals' 9 . 
Receptor binding. The insecticidal selectivity of S-endotoxins is 
due to high-affinity binding to specific membrane receptors 7 " 9 - 38 , 
which in three cases seem to be glycoproteins 38 " 40 . For several 
6-endotoxins the specificity-determining regions have been 
delimited by exchanging sequence segments between closely 
related toxins of differing specificities 13 " 15 . Guided by the loca- 
tion of secondary structures in the beetle toxin, a plaus- 
ible alignment of 5-endotoxin sequences was made for the non- 
819 



© 1QQ1 Nature PnhliQhinnftrniin 



ARTICLES 



BEST AVAILABLE COPY 




Sheet 1 



Sheet 2 



Sheet 3 
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FIG. 4 The three sheets of domain II. a, Schematic 
ribbon drawing of sheets 1. 2, and 3 laid side-by- 
side. Each is viewed from the exterior of the 
domain. Note the Greek-key topology of sheets 1 
and 2 and the similarity between their fold. b. 
Hydrogen bonding of the polypeptide backbone for 
the three sheets. The 0 strands are shown by the 
main-chain atoms and by the residue numbers at 
their ends; connecting strands are shown as coils. 
c, Ca trace of the three assembled sheets in 
domain II viewed towards domain I (stereoview). 
The Ca trace is shaded grey over sheet I. striped 
over sheet II, and white over sheet III. d. Cross- 
section of domain II (stereoview) showing the 
packing of three sheets in a triangle around the 
hydrophobic core. The view is towards domain III. 



conserved regions (ref. 12, and T. C. Hodgman, unpublished 
results). Hence the genetically identified specificity-determining 
regions can be mapped to equivalent positions in the beetle 
toxin structure, and these fall mainly in domain II. For instance, 
the dual specificity of CryllA for Lepidoptera and Diptera, as 
distinct from the Lepidoptera specificity in the closely related 
CryllB, is determined by residues 307-382 of their sequences 14 , 
which corresponds roughly to sheet 1 (Fig. 4a) plus strand /3 6 
in sheet 2 and the loop leading up to p 7 , whereas the Lepidoptera 



820 



specificity of CryllB is dependent on a longer segment 14 that 
would include both inner strands of sheet 2. Similarly, the 
toxicities of CrylA(a) and CrylA(c) to two lepidopteran insects 
depend on three segments termed x, y and z (ref. 15): amino-acid 
substitutions in y can reduce toxicity by up to 2,000-fold, and 
segments x and y interact in determining specificity. Aligned 
with the beetle toxin structure, segment x corresponds roughly 
to the outer strands fi A and 0 5 of sheet 1 and the whole of sheet 
2, including the loop entering 0, o in sheet 3; y corresponds to 
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Domain III 

FIG. 5 Domain III, schematic ribbon representation of the p sandwich, p 
strands forming the inner sheet are shaded grey. The topology of an 
eight-stranded *jelly-roir can be seen by following the p hairpin starting 
with p 13t p ls and p 23 in the inner sheet, continuing to p 16 and p 22 in the 
outer sheet, then p 17 and p 21 • £20 ' n tne inner sheet, and ending with p 1B 
and p 19 in the outer sheet. p 1A is an excursion from the hairpin and forms 
a fifth antiparallel strand of the outer sheet. Small parallel p sheets are 
added to one edge of the p sandwich, by hydrogen bonding of P x to p l3 
in the inner sheet and p 12 to p 1G in the outer sheet. Residue numbers in 
the P strands are: p 12 , 502-506; p 13 . 509-513; /3 14 . 519-525; 0 15 . 
536-541; P 16 , 547-554; p 17 , 558-569; p ia , 573-579; p 19 , 585-591; 
£20, 604-609; p 21 , 611-614; P 22 , 619-625; and P 23 , 631-643. 

strand p l0 of sheet 3 and the loop connecting 0 1O and p xx ; and 
z extends from P u to the C-terminal activation site. Furthermore, 
the interaction between x and y can be understood in terms of 
the proximity between p 4 on the edge of sheet 1 and p l0 on the 



edge of sheet 3. Although z was inferred 15 to extend into 
domain III, the combined evidence from genetics and receptor- 
binding assays in vitro for Lepidoptera toxins 9,41 correlates 
receptor recognition with sequence variations within domain 
II. We note that the p ribbons from all three sheets terminate 
in loops in a small region on the molecular apex, in a man- 
ner reminiscent of the complementarity-determining region of 
immunoglobins. 

Pore formation. The common mechanism of epithelial cell disrup- 
tion by 5-endotoxins of widely different specificities is believed 
to be the formation of lytic pores of 10 to 20 A diameter in the 
insect membrane 10 . The structure of the beetle toxin displays 
an apparatus for pore formation in the long, hydrophobic and 
amphipathic helices of domain I which could penetrate the 
membrane. Between the crystal structure in which the bouquet- 
like helical bundle internalizes all the hydrophobic surfaces, 
and the unknown pore structure where hydrophobic surfaces 
would be in intimate contact with the membrane lipids, large 
conformation changes must occur. In the absence of a full 
characterization of the pore-forming process, we propose the 
following by extrapolation from the crystal structure. 

The trigger for the conformational changes may be provided 
by receptor binding and the consequent interaction of toxin 
with the membrane bilayer. Membrane insertion follows rapidly, 
so that a major part of the bound 5-endotoxin cannot be dis- 
placed from the brush-border vesicles by other toxins recogniz- 
ing the same receptor sites 7,9 . As domain II and probably its 
apical region are most likely to bind the membrane receptors, 
the helices are expected to insert with the 'domain II end' (see 
Fig. 2a) oriented towards the cytoplasm. If helical hairpins are 
to initiate the membrane penetration, as probably happens for 
colicin 28,42,43 , they will probably be linked at the domain II end. 
So either of the helix pairs a 6 -or 7 or a 4 -a 5 could be the likely 
initiator. The ot 6 -<x 7 pair is favoured because it forms part of 
the conserved interface with domain II and is well positioned 
to sense the receptor binding. On the other hand, helix a 5 is 
the most conserved throughout the family of 5-endotoxins. Point 
mutations in a 5 reduce toxicity of a Lepidoptera toxin without 
reducing binding to membranes 44 . Proteolysis in the interhelical 
loops at the domain III end, as in the a 3 -a 4 loop 19,32 , may 
facilitate release of the helix pairs from the tertiary structure of 
the bundle. The insertion of a hairpin can create a defect in the 
membrane, allowing the rest of domain I to participate in pore 
formation in a cooperative manner. □ 
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